The effect of highband harmonic structure in the artificial bandwidth expansion of telephone speech

نویسندگان

  • Hannu Pulakka
  • Paavo Alku
  • Laura Laaksonen
  • Päivi Valve
چکیده

The quality of narrowband telephone speech can be improved by artificial bandwidth expansion (ABE), which generates missing frequency components above the telephone bandwidth using only information from the narrowband speech signal. Straightforward bandwidth expansion methods do not reproduce the harmonic structure of voiced sounds properly, but a pitch-adaptive technique can be used to approximate the correct alignment of harmonic frequencies. In this study, pitchadaptive highband alignment was implemented into an existing ABE method, and the quality of the modified method was studied with formal listening tests in Finnish andMandarin Chinese. The effect of the highband harmonic structure was found unimportant for the perceived speech quality. Consequently, computationally expensive pitch adaptation was found to be unnecessary for the bandwidth expansion of telephone speech.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Wideband Speech Recovery from Narrowband Speech Using Classified Codebook Mapping

Speech sounds occupy 8 kHz or more of bandwidth. However, current public telephone networks limit the speech bandwidth to 300–3400 Hz. Telephone speech is characterized by thin and muffled sounds, and degraded speaker identification. We describe an algorithm which generates the missing highband components from the narrowband speech signal. The algorithm is based on three acoustic-phonetic class...

متن کامل

Pseudo-wideband Speech Reconstruction from Telephone Speech

The bandwidth of telephone speech is limited to a 300 – 3400 Hz bandwidth. The sound quality is much lower than for broadcast radio and audio compact discs. We present an algorithm to regenerate the missing highband components (3.4–7 kHz). The highband spectrum recovery is based on a Line Spectrum Frequency (LSF) VQ codebook mapping from the narrowband speech to the high frequency components. T...

متن کامل

Highband spectrum envelope estimation of telephone speech using hard/soft-classification

The bandwidth for telephony is generally defined to be from 300–3400 Hz. This bandwidth restriction has a noticeable effect on speech quality. We present an algorithm which recovers the missing highband parts from telephone speech. We describe an MMSE estimator using hard/soft-classification to create the missing highband spectrum envelope. The classification is motivated by acoustic phonetics:...

متن کامل

Objective analysis of the effect of memory inclusion on bandwidth extension of narrowband speech

For the purpose of improving Bandwidth Extension (BWE) of narrowband speech, we continue our recent work on the positive effect of exploiting the temporal correlation of speech on the dependence between speech frequency bands. We have shown that such memory inclusion into MFCC speech parametrization translates into higher highband certainty. In the work presented herein, we employ VQ to estimat...

متن کامل

Speech enhancement using STC-based bandwidth extension

Telephone speech is typically bandlimited to 4 kHz, resulting in a ‘muffled’ quality. Coding speech with bandwidth greater than 4 kHz reduces this distortion, but requires a higher bit rate to avoid other types of distortion. An alternative to coding wider bandwidth speech is to exploit correlation between the 0-4 kHz and 4-8 kHz speech bands to resynthesize wideband speech from narrowband spee...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2007